Online Planning in Continuous POMDPs with Open-Loop Information-Gathering Plans
نویسنده
چکیده
This paper studies the convergence properties of a receding-horizon information-gathering strategy used in the recently presented RBSR planner for continuous POMDPs. The planner uses a combination of randomized exploration, particle filtering, and goal-seeking heuristic policies to achieve scalability to high-dimensional continuous spaces. We show that convergence is ensured in a subclass of problems where information gain rate exceeds the rate of information loss through process noise. Because these rates are not defined myopically RBSR is able to perform long open-loop information-gathering plans. The technique is demonstrated on a variety of discrete planning benchmarks as well as target-finding and localization problems in up to 7D continuous state spaces.
منابع مشابه
Execution Monitoring to Improve Plans with Information Gathering
There has been much recent interest in planning problems with deterministic actions but stochastic observations. Examples include Mars rover planning, robot monitoring tasks and the Rocksample domain from the planning competition. However, theoretical results show that in general these problems are as hard as solving partially observable Markov decision problems (POMDPs). We propose an approach...
متن کاملInformation Gathering and Reward Exploitation of Subgoals for POMDPs
Planning in large partially observable Markov decision processes (POMDPs) is challenging especially when a long planning horizon is required. A few recent algorithms successfully tackle this case but at the expense of a weaker information-gathering capacity. In this paper, we propose Information Gathering and Reward Exploitation of Subgoals (IGRES), a randomized POMDP planning algorithm that le...
متن کاملUniied Information and Control Flow in Hierarchical Task Networks
Much recent planning research has focused on two related issues. First, there has been a strong interest in information-gathering (or \sensing", or \knowledge-producing") actions. Second, has been an investigation of plans with sophisticated control structures, such as conditional branches and loops. But the combination of these two lines of research poses a represen-tational problem: plans wit...
متن کاملUnified Information and Control Flow in Hierarchical Task Networks
Much recent planning research has focused on two related issues. First, there has been a strong interest in information-gathering (or "sensing", or "knowledgeproducing") actions. Second, has been an investigation of plans with sophisticated control structures, such as conditional branches and loops. But the combination of these two lines of research poses a representational problem: plans with ...
متن کاملPlanner Scheduler Executor
Much recent planning research has focused on two related issues. First, there has been a strong interest in information-gathering (or \sensing", or \knowledge-producing") actions. Second, has been an investigation of plans with sophisticated control structures, such as conditional branches and loops. But the combination of these two lines of research poses a representational problem: plans with...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011